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Cysteine biosynthesis is a potential target for drug development against parasitic 
Leishmania species; these protozoa are responsible for a range of serious 
diseases. To improve understanding of this aspect of Leishmania biology, a 
crystallographic and biochemical study of L. major cysteine synthase has been 
undertaken, seeking to understand its structure, enzyme activity and modes 
of inhibition. Active enzyme was purified, assayed and crystallized in an 
orthorhombic form with a dimer in the asymmetric unit. Diffraction data 
extending to 1.8 A resolution were measured and the structure was solved by 
molecular replacement. A fragment of y-poly-D-glutamic acid, a constituent of 
the crystallization mixture, was bound in the enzyme active site. Although a 
D-glutamate tetrapeptide had insignificant inhibitory activity, the enzyme 
was competitively inhibited (K { = 4 \iM) by DYVI, a peptide based on the 
C-terminus of the partner serine acetyltransferase with which the enzyme forms 
a complex. The structure surprisingly revealed that the cofactor pyridoxal 
phosphate had been lost during crystallization. 
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1. Introduction 

Leishmania, a widespread and important protozoan pathogen of 
humans and animals, requires cysteine for protein biosynthesis and 
as a precursor of trypanothione, a glutathione-spermidine conjugate 
unique to trypanosomatids with an essential role in redox metabolism 
and antioxidant defence (Krauth-Siegel & Comini, 2008). Cysteine is 
also the source of reduced sulfur for the biosynthesis of important 
metabolites such as coenzyme A, enzyme cofactors and iron-sulfur 
clusters (Nozaki et al, 2005). The vital role of cysteine raises the 
questions of how Leishmania obtains the amino acid, how cysteine 
metabolism in Leishmania might differ from that in the mammalian 
host and whether such differences might be targeted in drug- 
discovery research. L. major does not have a high-affinity transporter 
for the uptake of cysteine, but it can acquire methionine and, like the 
mammalian host, it has the enzymes required to convert methionine 
to cysteine by transsulfuration (Williams et al., 2009). The parasite can 
also produce cysteine from serine in a two-step process (Williams et 
al., 2009). Firstly, serine acetyltransferase (SAT) generates O-acetyl- 
serine (OAS) to supply the substrate for the second stage, which 
is catalyzed by the pyridoxal phosphate (PLP)-dependent cysteine 
synthase (CS; EC 2.5.1.47). This de novo pathway for cysteine 
biosynthesis is found in plants, bacteria and some protozoa, but is 
absent from mammals. In principle, L. major CS (LraCS) may 
represent a drug target, and an improved understanding of the 
enzyme might usefully inform on its potential in this respect. In 
particular, knowledge of the structure can support the development 
of reagents to chemically validate the target or to provide early-stage 
information on inhibitors (Hunter, 2009). 

Some types of CS, including bacterial O-acetylserine sulfhydrylase 
type A (OASS-A) and plant O-acetylserine thiol-lyase (OAS-TL), 
combine reversibly with SAT to form a bi- enzyme complex in which 
SAT is active and CS is strongly inhibited (Campanini et al, 2005). 
The substrates of CS are effectors of complex formation; the complex 
is dissociated by elevated levels of OAS but is stabilized by sulfide. 
The complexes formed in plants and bacteria have distinctive features 
that indicate different regulatory functions (Salsi, Campanini et al, 
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2010; Wirtz et al, 2010). It has been established that the C-terminal 
end of SAT is critical for its interaction with CS and, in particular, 
all SATs possess a C-terminal isoleucine which is essential for CS 
binding. Peptides corresponding to the C-terminus of SAT bind to the 
active site of CS and structural data have revealed that the 
carboxylate group of the C-terminal isoleucine occupies the same 
space and makes the same interactions as the carboxylate of the 
a-aminoacrylate catalytic intermediate formed after ^-elimination of 
acetate from the substrate OAS (Rabeh & Cook, 2004; Huang et al, 
2005; Francois et al, 2006; Schnell et al, 2007; Salsi, Bayden et al., 
2010). A four-amino-acid SAT peptide has been shown to be a 
competitive inhibitor of Mycobacterium tuberculosis CS with a K { of 
5 \xM, providing a simple mechanism for complex formation and its 
dissociation in the presence of elevated levels of OAS (Schnell et al, 
2007). Sequence alignments indicate that LraCS contains a SAT- 
binding motif that was originally identified in Arabidopsis thaliana 
OAS-TL O^OAS-TL; Bonner et. al, 2005) and the enzyme can also 
bind SAT when the proteins are co-expressed in Escherichia coli 
(Williams et al, 2009). 

We undertook a crystallographic and biochemical study of LraCS 
to investigate the interactions of the enzyme with ligands, including 
potential inhibitors. Our overall aim was to improve understanding of 
the enzyme in Leishmania and to provide information that might 
help to assess the potential of CS as a target for structure-based 
approaches to develop inhibitors with suitable chemical properties to 
underpin early-stage drug discovery (Hunter, 2009). 



2. Materials and methods 

2.1. Protein expression, purification and crystallization 

The recombinant E. coli expression system for LraCS (Williams et 
al, 2009) was modified by subcloning the LmCS gene from vector 
pET21a+ into pET15bTEV to allow production of an N- terminally 
His-tagged protein, which was purified following a standard protocol 
(Bond et al, 2001). Briefly, the first stage involved nickel ion-affinity 
chromatography through a 5 ml Ni-NTA column (Qiagen). The 
product was eluted in a linear imidazole-concentration gradient, 
which was followed by incubation for 2 h with His-tagged tobacco 
etch virus (TEV) protease at 303 K prior to dialysis at room 
temperature against 20 mM Tris-HCl, 150 mM NaCl pH 7.5 for 1 h. 
The resulting mixture was reapplied onto the Ni-NTA column, which 
binds the cleaved His tag, the TEV protease and any remaining 
uncleaved LraCS. The LraCS from which the His tag had been 
cleaved was present in the flowthrough. Fractions were analyzed 
using SDS-PAGE and those containing LraCS were pooled. The 
protein was further purified by size-exclusion chromatography using 
a Superdex 200 26/60 column (GE Healthcare) equilibrated with 
20 mM Tris-HCl, 150 mM NaCl pH 7.5. The final level of LraCS 
purity was confirmed by matrix-assisted laser desorption/ionization- 
time of flight mass spectrometry. In preparation for crystallization, 
the sample was dialyzed into 10 mM Tris-HCl, 100 mM NaCl pH 7.8 
and concentrated using a Vivaspin 20 (Sartorius) to provide a stock 
solution for crystallization. A theoretical extinction coefficient of 
16 180 M" 1 cm" 1 at 280 nm was used to estimate protein concentra- 
tion (ProtParam', Gasteiger et al, 2005); the theoretical mass of one 
subunit is estimated as 35.6 kDa. 

Crystallization was achieved at 293 K using the hanging-drop 
vapour-diffusion method with 0.75 ul protein solution at a concen- 
tration of 10 mg ml -1 mixed with 0.75 ul reservoir solution consisting 
of 7.5% PGA-LM (y-poly-D-glutamic acid low molecular weight) and 
19% PEG 3350 (polyethylene glycol average mass 3350) in 0.1 M 
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Table 1 

Data-collection and refinement statistics. 

Values in parentheses are for the highest resolution bin of approximate width 0.1 A. 
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structure factor. § R tree is the same as ^ wor k, except calculated using 5% of the data that 
were not included in any refinement calculations. ^| Ramachandran analysis from 
Coot. 



Tris-HCl pH 7.8. Crystals grew over a period of 2-3 d to approximate 
dimensions of 50 x 50 x 250 um and were characterized in-house 
using a Rigaku HF007 rotating-anode X-ray generator coupled to an 
R-AXIS IV ++ image-plate detector. The presence of PGA-LM and 
PEG 3350 in the mother liquor allowed the crystals to be cooled to 
approximately 103 K in a stream of gaseous nitrogen without addi- 
tional cryoprotection. The crystals were orthorhombic and belonged 
to space group P2 1 2 1 2 1 , with unit-cell parameters a = 48.96, b = 86.3, 
c = 134.0 A. Suitable crystals were stored in liquid nitrogen for 
subsequent data collection at the European Synchrotron Radiation 
Facility (ESRF), Grenoble, France. 

2.2. X-ray data collection, processing, structure solution and 
refinement 

A well formed sample was selected and diffraction data were 
measured on beamline ID23-2 at the ESRF using a MAR 225 CCD 
detector. Data were indexed and integrated using XDS (Kabsch, 

2010) and scaled using SCALA (Evans, 2006); the statistics are 
summarized in Table 1. Diffraction data were collected from a single 
crystal at a wavelength of 0.87260 A. The search model for molecular 
replacement was prepared from the E. coli cysteine synthase B 
structure (PDB entry 2bhs; Claus et al, 2005). The sequence identity 
between the search model and LmCS is 39%. Pruning and mutation 
of this model was carried out using CHAINSAW (Stein, 2008). 
Molecular replacement was performed in MOLREP (Vagin & 
Teplyakov, 2010) using a monomer from 2bhs to search for two 
molecules in the asymmetric unit. A dimer was located, giving a score 
of 0.396. Refinement was performed in REFMACS (Murshudov et al, 

2011) and was alternated with rounds of electron-density and 
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difference density map inspection and model manipulation together 
with water and ligand incorporation using Coot (Emsley & Cowtan, 
2004). MolProbity (Chen et al, 2010) was used to investigate model 
geometry in combination with the validation tools provided in Coot. 
Final model analysis was performed using JCSG Quality Control 
Check (http://smb.slac.stanford.edu/jcsg/QC/). Crystallographic 
statistics are presented in Table 1. Analyses of surface areas and 
interactions were made using the PISA server (Krissinel & Henrick, 
2007) and the figures were prepared with PyMOL (DeLano, 2002). 
Amino-acid sequence alignments were carried out using the program 
MUSCLE (Edgar, 2004). 



2.3. Biochemical analysis 

For biochemical analysis, recombinant LmCS was expressed and 
purified as a C-terminally His-tagged protein as described previously 
(Williams et al., 2009). The A. thaliana OAS-TL gene was subcloned 
from pET3dAtOASTL into pET21 and the recombinant protein 
^4/OAS-TL was expressed and purified in the same way as LmCS. 

CS activities were determined at room temperature in 100 ul 
200 mM potassium phosphate, 1 ml EDTA, 0.2 mM PLP, 1 mg ml" 1 
BSA, 3 mM OAS, 2 mM sodium sulfide pH 7.8 with 8 ng LmCS or 
12 ng A/OAS-TL. The reaction was started by the addition of sodium 
sulfide after incubation of all other components for 5 min. Samples 
were taken before addition of sodium sulfide (0 min) and then every 
2 min for 10 min; the cysteine produced was quantified using the azo- 
dye method described previously (Williams et al., 2009). The rates of 
cysteine production were linear for 10 min and the specific activities 
obtained for LmCS and ,4/OAS-TL were 180 ± 18 and 130 ± 
20 umol min -1 mg -1 , respectively. C-terminal SAT peptides are 
known to bind to the active sites of the plant OAS-TL (Francois et al., 
2006) and bacterial OASS enzymes (Huang et al., 2005) and a peptide 
DFSI based on the SAT sequence is a competitive inhibitor of 
M. tuberculosis OASS (Schnell et al., 2007). Thus, peptides based on 
the A. thaliana and L. major SATs and the PGA bound in the crystal 
of LmCS were tested as inhibitors of the enzyme. Inhibition data 
were determined by adding various concentrations of different 
peptides to the pre-incubation mixture and then measuring the 




Figure 1 

A ribbon diagram of the LmCS dimer. The helices and strands of subunit A are 
coloured green and brown, respectively, and in subunit B they are coloured cyan 
and blue, respectively. The position of the disordered loops A21A-A2A1 and 5213- 
5241 are marked by dotted lines. The C- and N-terminal positions are labelled, as 
are the domains. The two molecules of y-poly-D-glutamic acid are depicted in stick 
form, with C positions coloured yellow, O positions red and N positions blue. 



enzyme activity. IC 50 curves were obtained using GraFit 5 (Erathicus) 
by plotting the initial rates measured with at least six different 
concentrations of the peptide. All IC 50 values are the means ± 
standard deviations of three independent determinations, unless 
otherwise stated. The kinetics of inhibition by the tetrapeptide DYVI 
were investigated by measuring the initial rates of LmCS without the 
peptide and then with four different concentrations of peptide (10- 
100 uM) and six different concentrations of OAS (2.5-20 mM) at a 
fixed concentration of 2.0 mM sodium sulfide. The type of inhibition 
was determined from the pattern of the double-reciprocal plots of 1/V 
against 1I[S] for the different peptide concentrations. The K { was 
determined by replotting the slopes against the peptide concentration 
which for competitive inhibition is linear, with the intersect on the x 
axis representing — K v 

3. Results and discussion 

3.1. General comments and overall LmCS structure 

The structure of LmCS was determined to a resolution of 1.8 A. 
The biologically active unit, a dimer, constitutes the asymmetric unit 
(Fig. 1). Subunit A contains residues 3-213 and 241-333, whilst 
subunit B comprises residues 4-214 and 241-333. A surface loop from 
residues 214 to 241 is disordered and is therefore missing from the 
model. The LmCS subunit contains two domains. The smaller domain 
I is constructed by residues 51-158, which primarily form a four- 
stranded /3-sheet surrounded by four a-hefices. The larger domain II 
comprises residues 21-50 and 159-306. Domain II contains four 
a-helices and six /3-strands which, together with a /3-strand contrib- 
uted from the partner-subunit domain I, form a seven-membered 
/3-sheet. In addition, residues 307-333 at the C-terminus form an 
extended helix-loop-helix structure that stretches across the surface 
of the partner subunit. This extension is positioned on the opposite 
face of the dimer to that of the /3-sheet intersubunit interaction. These 
two areas make major contributions to the area of the dimerization 
interface, which constitutes 22% or 3280 A 2 of the surface area of 
each subunit. 

The enzyme purified from the E. coli expression host was cata- 
lytically active and displayed a yellow colour. Both observations are 
consistent with the presence of the PLP cofactor. In addition, PLP 
was added prior to crystallization, seeking to ensure full occupancy. 
However, the crystals were colourless and there was no electron 
density to indicate that PLP was present. The affinity of the crystal- 
lization agent PGA-LM to bind to LmCS may contribute to the loss 
of PLP that is observed and the position of a loop formed by residues 
181-190 is likely to be a consequence of the absence of the cofactor. 

3.2. Binding of y-poly-D-glutamic acid 

A fragment of the crystallization agent PGA-LM is bound in an 
ordered fashion to the same region of both subunits of the LmCS 
dimer. PGA is a pseudopeptide comprising D-glutamic acid residues 
linked through the amide N atom and the y-carboxy O atom of an 
adjacent unit. The use of this compound in protein crystallization was 
highlighted by Hu et al. (2008). 

In subunit A PGA-^4 comprises five D-glutamic acid moieties 
(Fig. 2), while PGA-i? consists of three D-glutamic acids bound to 
subunit B (data not shown). The first and second glutamate moieties 
of PGA-^4 overlap with the second and third glutamates of FGA-B 
(data not shown). PGA-^4 is extended by three additional moieties at 
one end, while FGA-B has an additional moiety at the other end of 
the ligand. The interactions between carboxyls from PGA and the 
side chain and main chain of Thr83 and the main-chain amides of 
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Asn82, Ser274 and Phe273 are common to both binding sites. Also 
involved in binding FGA-A are Leu312, Ala311 and Serl07 (Fig. 3), 
while binding of FGA-B also involves Ser78, Ser80, ArgllO and 
ThrlOl (data not shown). 

3.3. Comparisons with /4fOAS-TL and the PLP-binding site 

LmCS shows a high level of sequence identity to other O-acetyl- 
serine sulfhydrylases in the PDB. Analysis of the structural conser- 
vation using DALI (Holm & Rosenstrom, 2010) revealed the highest 
similarity to be to AfOAS-TL (PDB entry lz7w; Bonner et al, 2005), 
which shares 47% sequence identity. The superimposition of subunits 
with LmCS gives an r.m.s.d. of 0.9 A over 285 C a residues. Differ- 
ences between the structures are primarily restricted to loops posi- 
tioned around the active site. The region from Aspl51 to Tyrl57 in 
LmCS forms the start of al, while in ^4/OAS-TL this helix is trun- 
cated (Fig. 4). The conserved motif QFJlNPJlN that is present in the 
vast majority of OAS-TL sequences is replaced by QFATKYN in 
LmCS; this replacement is also found in L. braziliensis , L. infantum 
and Trypanosoma cruzi. The first residue of this motif, glutamine, 
is normally directed towards the active site, although it remains too 
distant to interact directly with the cofactor. Alteration to the TKY 
motif causes restructuring of this region, extending the helix that 
normally follows the motif by an extra two turns. This has two effects. 
Firstly, the glutamine (Glnl52) residue is placed on the opposite face 
of the helix, far removed from the active site. In addition, the 



phenylalanine (Phel53) is also positioned away from the entrance to 
the active site. In LmCS, the placement of these two residues ensures 
that the active site is considerably widened with respect to that found 
in orthologues of known structure. 

The high level of structural conservation between LmCS and the 
existing structures of O-acetylserine sulfhydrylases is such that the 
expected binding position for the cofactor PLP can be reliably 
derived. A lysine (Lys51 in LmCS) forms a Schiff base with PLP, with 
a conserved asparagine and serine, Asn82 and Ser274 in LmCS, 
forming hydrogen bonds to PLP. Further residues predicted to orient 
and hold the PLP in position are Glyl86, Thrl82, Glyl83 and Thrl85 
of LmCS, which are strictly conserved as Glyl81, Thrl87, Glyl88 and 
T190 in ArOAS-TL. Structural differences are observed between 
LmCS and ^IrOAS-TL in the loop formed by residues 184-190. This 
loop is glycine-rich; it is therefore likely to be mobile and its position 
in LmCS is probably influenced by the absence of PLP from the 
active site. 

The binding of PGA may have contributed to the absence of PLP 
and the unresolved loop between residues 214 and 241 in LmCS 
(Fig. 4). Again comparing LmCS with ^4/OAS-TL, it is expected that 
this loop would fold down and bind into the same groove on the CS 
surface as that occluded by PGA (Fig. 4). Closer analysis of the 
position of this loop in ^4*OAS-TL and comparison with the LmCS 
structure reveals that PGA interacts with Ser274 (Fig. 3) and 
supplants the interactions normally expected to form when PLP is 
bound in the active site (Fig. 4). The position of the side chain of the 




Figure 2 

Stereoview of the PGA fragment bound to subunit A. An F Q — F c OMIT difference density map is shown, where F G are the observed and F c are the calculated structure 
factors derived from the crystallographic model excluding the contributions from PGA atoms. The map is contoured at 3a (blue chicken wire) and PGA is shown as a stick 
model with C positions coloured yellow, O positions red and N positions blue. The protein is depicted in a green ribbon format. The D-glutamic acid units are numbered 1-5. 




Figure 3 

Stereoview of the binding of FGAA to LmCS. PGA is shown as in Fig. 1 and LmCS is shown in green, with N and O atoms of specific side chains coloured blue and red, 
respectively. Water molecules are shown as cyan spheres and hydrogen-bonding interactions are depicted as yellow dotted lines. 
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conserved ArgllO changes considerably as it forms interactions with 
PGA, whereas normally it would be expected to interact with the 
highly conserved Gln224 when the 214-241 loop closes over the 
active site. 

3.4. Biochemical analysis 

Following the initial observation of PGA binding in the active site, 
we tested a (y-Glu) 4 derivative as a potential inhibitor. Despite 
testing up to a concentration of 500 \xM, we were unable to detect any 
significant inhibition (data not shown). This may relate to the finding, 
described above, that PGA binding to LraCS does not mimic that of 
peptides to the active PLP-containing enzyme and may simply be a 
consequence of the high level of PGA present in the crystallization 
conditions. 

Peptides corresponding to the C-terminus of the L. major and 
A. thaliana SATs were also tested as inhibitors of LraCS (Table 2). 
The results obtained for the plant SAT peptide (DYVI), which 
inhibited LraCS with an IC 50 of 7 \xM and displayed a similar activity 
towards A. thaliana OAS-TL, are presented in Fig. 5. Surprisingly, the 
Leishmania C-terminal peptide GSGI was only a weak inhibitor, with 



Table 2 

Inhibition of LmCS by peptides corresponding to the C-termini of the L. major and 
A. thaliana SATs. 

Results in bold represent the mean ± standard deviation of three independent 
determinations. All other results are from a single IC 50 curve. ND, not determined. 

A f OAS-TL IC 50 (\iM) LmCS IC 50 (|iM) 



Peptides 

DYVI 8 ± 3 7 ± 1 

GSGI <2000 1500 

DGSGI 170 620 ± 50 

EGDGSGI 240 270 ± 70 



N-blocked amino acids 

L-Isoleucine ND >4000 

Af-Benzyl-L-isoleucine ND >1000 

jY-3-Indolylacetyl-L-isoleucine ND 260 

N-CBZ-L-isoleucine ND 320 

N-CBZ-L-valine-isoleucine ND 1410 



an IC 50 of approximately 1.5 mM. Longer peptides derived from the 
Leishmania SAT sequence had greater activity, with the C-terminal 
heptapeptide (EGDGSGI) inhibiting LraCS with an IC 50 of 270 uM. 
Similar results were obtained when the Leishmania SAT peptides 
were tested on AtOAS-TL. These findings are consistent with data 





Figure 4 

Stereoview of the PGA-A-LraCS complex overlaid with a peptide-^4 /OAS-TL complex. The LmCS structure is shown as in Fig. 3, with the locations of al and the TKY 
motif indicated. The model of the y^OAS-TL-peptide complex (PDB entry 2isq; Francois et al, 2006) is coloured purple. The peptide sequence corresponds to that of the 
C-terminal residues of SAT. Hydrogen-bonding interactions are depicted as dashed lines coloured according to the structure in which they occur. 





, # 100 uM 



50 \lM 

20 uM 

10 uM 
-X 0 fiM 

10 10" ^ / 0 0 0.1 0.2 

DYVI(uM) X J \/[S] 

(a) (b) 

Figure 5 

Inhibition of LmCS by the synthetic peptide DYVI. (a) Dose -response curve. Initial rates were determined in reactions containing 3 mM OAS and eight different 
concentrations of the peptide DYVI (0.1-400 \xM). The graph shows percentage activity plotted against concentration of the peptide and the IC 50 curve fitted with GraFit 5. 
(b) Double-reciprocal plots of initial velocity against substrate concentration showing competitive inhibition. Reactions contained four different concentrations of the 
peptide (10-100 uM) with six different concentrations of the substrate OAS (2.5-20 mM). Lines were plotted for each peptide concentration by linear regression using 
Microsoft Excel. Reactions contained 100 uM DYVI (closed circles), 50 uM DYVI (open circles), 20 uM DYVI (closed triangles), 10 uM DYVI (open triangles) or no 
inhibitor (crosses). The inset is the secondary plot of slope values against the concentration of DYVI. 
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from the mutational analysis of SAT from E. coli, which showed that 
both length and the presence of negatively charged residues were 
important for complex formation (Zhao et al, 2006). Attempts to 
cocrystallize LmCS with DYVI were unsuccessful; thus, the binding 
could not be analysed further. 

Heterologous binding of divergent SAT peptides has been 
reported previously (Campanini et al, 2005; Francois et al, 2006). The 
K { of the A. thaliana SAT peptide DYVI for LmCS was determined 
by measuring the activity with different concentrations of OAS and 
the peptide at a fixed concentration of sodium sulfide. The double- 
reciprocal plots showed that the apparent K m increased with 
increasing peptide concentration, but the V max was not altered and 
the secondary plot of slope against peptide concentration was linear 
(data not shown). These results indicate that the heterologous SAT 
peptide DYVI is a competitive inhibitor of LmCS, with a K{ of 4 [iM , 
similar to the inhibition reported for the M. tuberculosis enzyme by 
its cognate SAT peptide (Schnell et al, 2007). 

The carboxylate group of the invariant C-terminal isoleucine 
provides an anchor for peptide binding and forms hydrogen bonds 
between key active-site residues that also bind the substrate OAS. 
These interactions are highly conserved between species (Francois et 
al, 2006; Schnell et al, 2007). These are in part conserved in the PGA 
binding through Thr83, but the carboxylate lies too distant from 
Ser79 to conserve the second interaction (Fig. 3). 

Given the importance of the C-terminal isoleucine residue, the free 
amino acid and certain N-blocked derivatives were also tested for 
inhibition (Table 2). Isoleucine itself had no activity at 4 mM, whereas 
two of the N-blocked derivatives inhibited with IC 50 values of 
250 [iM. This increased activity could be the result of the removal of 
the positively charged amino group or the addition of the hydro- 
phobic blocking group. The carboxybenzyl (CBZ) blocked dipeptide 
CBZ-L-valine-isoleucine showed a greatly reduced activity compared 
with other blocked amino acids, indicating that a valine at this posi- 
tion of the peptide may not be optimal. 

The relatively weak activity of the Leishmania SAT peptides for 
competitive inhibition of LmCS raises questions as to whether CS 
and SAT could form a functional protein-protein complex in this 
organism. Differences in the dissociation constants observed for plant 
and bacterial cysteine synthase complexes (CSCs) were thought to 
result from differences in the affinity of the SAT C-terminus for the 
CS active site (Wirtz et al, 2010). However, complex formation in 
plants and bacteria is now known to involve conformational changes 
in CS that are not induced by binding of the C-terminal SAT peptide 
alone (Campanini et al, 2005; Salsi, Campanini et al, 2010; Wirtz et 
al, 2010; Kumaran et al, 2009). Additional interactions between SAT 
and CS appear to be required. A sequence motif that was first 
identified in ^UOAS-TL has been implicated in complex formation 
with SAT by mutagenesis of several conserved basic residues (Lys217, 
His221 and Lys222) in AOAS-TL (Bonner et al, 2005), and mutation 
of the corresponding residues (Lys222, His226 and Lys227) in LmCS 
also prevented complex formation when CS and SAT were co- 
expressed in E. coli (Williams et al, 2009). These residues are located 
on the disordered 213-241 loop in LmCS and are therefore well 
placed to bind a partner protein at the active site. Although it has 
been predicted on the basis of molecular modelling that the 
C-terminal sequence of Leishmania SAT would not be able to bind to 
the CS active site (Marciano et al, 2010), our inhibition data, and 
the conservation of basic residues shown to contribute to complex 
formation, suggest that L. major SAT and CS could indeed interact. 
The relatively weak initial interaction with the C-terminal isoleucine 
might precede conformational changes that support complex 
formation as observed in the assembly of bacterial CSCs (Salsi, 



Bayden et al, 2010; Wang & Leyh, 2012). Further work would be 
required to investigate this aspect of LmCS function, although the 
presence of PGA in the active site and the loss of PLP cofactor 
indicate that these crystallization conditions and the crystal form 
obtained in this study are unsuited to a study of LmCS complexes. 
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